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(N \ Abstract 

H— > . 

O . We give a description of gravitons in terms of an SL(2,C) connection field. The gauge-theoretic 

Lagrangian for gravitons is simpler than the metric one, in particular because the Lagrangian only 
, depends on 8 components of the field per spacetime point as compared to 10 in the Einstein-Hilbert 

^v^j case. Particular care is paid to the treatment of the reality conditions that guarantee that one is 

dealing with a system with a hermitian Hamiltonian. We give general arguments explaining why 
the connection cannot be taken to be real, and then describe a reality condition that relates the 
hermitian conjugate of the connection to its (second) derivative. This is quite analogous to the 
treatment of fcrmions where one describes them by a second-order in derivatives Klein-Gordon 
ijj ', Lagrangian, with an additional first-order reality condition (Dirac equation) imposed. We find 

' many other parallels with fermions, e.g. the fact that the action of parity on the connection is 

related to the hermitian conjugation. Our main result is the mode decomposition of the connection 
^vq ' field, which is to be used in forthcoming works for computations of graviton scattering amplitudes. 

> ' 

in 

1 Introduction 

O 

Work [lj showed that A / General Relativity (GR) can be described in the "pure connection" 
formulation, in which the only dynamical field of the theory is a (complexified) S0(3) ~ SU(2) 
{Sj ■ connection rather than the metrical Paper [3] made the first steps towards setting up the perturbation 
theory in this formalism. In particular, the usual propagating degrees of freedom of GR (gravitons) 
were exhibited, and the propagator obtained. It was also shown that the same formalism is applicable 
to a very large class of (modified) gravity theories describing, as GR, just two propagating polarizations 
. of the graviton. 

Here we develop this pure connection formalism for gravity further. This is the first in a series of 
papers aimed at studying how perturbative gravity can be described in this language. The principal 
aim of the present paper is to treat the linearized theory in the amount sufficient for later computations 
of e.g. graviton scattering amplitudes. However, interactions are considered only in the second paper 
of the series. 

In our treatment of the linearized theory particular attention is paid to the issues of the hermiticity 
of the arising quadratic Lagrangian. Indeed, as already mentioned, in the gauge-theoretic description 
of metric of Lorentzian signature one works with complexified SU(2), and thus SL(2,C), connections. 



1 Work [T] gave a gauge-theoretic description of a non-zero cosmological constant GR. Earlier works of Capovilla, Dell 
and Jacobson, see [5] and [3], provide a similar description of the A = case. However, the action principle proposed in 
these works contains an additional auxiliary field on top of the connection. There is no need for such a field when A / 0, 
which results in literally a "pure connection" formulation. 
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The Lagrangian then depends on the connection meromorphically, i.e. the complex conjugate of 
the connection field never enters. Such a description is only viable if some reality conditions are 
additionally imposed, and we discuss these in details in the present paper. Thus, our main results are 
the treatment of the hermiticity issues, as well as the related decomposition of the connection field into 
the modes. We also discuss the delicate issues of discrete C, P, T symmetries. The mode decomposition 
obtained in this paper gives everything that is needed for computations (performed in the second paper 
from the series) of graviton scattering amplitudes from the connection field correlation functions. 

Some aspects of our gauge-theoretic description of gravitons are quite unusual, and are therefore 
worth explaining already in the Introduction. To understand what is going on, it turns out to be 
particularly useful to use the language of (2-component) spinors. Before we explain how spinors 
appear in the pure connection description of gravity, let us remind the reader some very basic facts 
about them. 



1.1 Spinors 

We will necessarily be brief here, and send the reader to e.g. [5] for more details. We recall that a 
tetrad e is a map, at each spacetime point p, from the tangent space T p M to a copy of Minkowski 
space M 1 ' 3 : 

e : T P M -> M 1 ' 3 . (1) 

The pull-back of the Minkowski metric r\ on M 1,3 gives the spacetime metric. Using the index notation 
we can write g^ v = e 1 e^rju, where //-,... are the spacetime and /, . . . are "internal" indices, i.e. those 
referring to the Minkowski space Ai 1,3 quantities. The object rjjj is the Minkowski metric, for which 
we choose the signature (—,+,+, +). 

The spinors arise by introducing an identification between Minkowski vectors x and 2x2 (anti-) 
hermitian matrices 

. / x° + x 3 x 1 - ix 2 \ , . 

\ x 1 + ix 2 x° — x 3 J ' 

The Minkowski norm of x 1 is then expressed as the determinant of x: 

- (x ) 2 + (x 1 ) 2 + (x 2 ) 2 + (x 3 ) 2 = det(x). (3) 

It is then easy to see that the space of anti-hermitian matrices is preserved by the following action of 
the group SL(2,C): 

x^ 5 x 5 t, 9 GSL(2,C). (4) 

It is also clear that the above action preserves the determinant of x and thus the Minkowksi norm 
of the corresponding x 1 . This provides an identification between the group SL(2,C) and the Lorentz 
group SO (1,3): 

SO(l,3) ~ SL(2,C). (5) 

The 2-component spinors are then objects that realize two inequivalent fundamental representations 
of the group SL(2, C). Objects of one type, to which we shall refer as unprimed (using the GR 
terminology), transform simply as length 2 columns on which g € SL(2,C) acts by multiplication 
from the left. The objects of the second type (primed spinors) transform in a complex conjugate 
representation, and can be thought of as rows of length 2, on which g^ G SL(2, C) acts from the right. 
Let us denote the space of spinors of unprimed type by S + and that of the opposite type spinors by 
SL. Both spaces have an SL(2, (C)-invariant "metric", which is however anti-symmetric, so that the 
norm of every object is zero. 
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It is then clear that the matrix x is an object of a mixed type 

xeS + ®5_. (6) 

Let us formalize this by introducing an index notation x AA ' for the matrix x. Here A, A' = 1,2 
are the spinor indices, with an object of the type X A £ S+ referred to as an unprimed spinor, and 
X A £ S- as primed. Note that we can always identify the spinor spaces S± with their duals using the 
SL(2, C)-invariant metric. One must, however, be careful with the operation of raising and lowering 
of spinor indices, as this now introduces a minus sign (since the metric is anti-symmetric). We now 
write x AA := \\J29 AA x 1 , where we have introduced a matrix 9 AA which is the object that fixes the 
identification between Minkowski vectors x 1 and anti-hermitian 2x2 matrices x. The factor of y/2 
is introduced for future convenience (so that the expression for 6f A in terms of the so-called doubly 
null tetrad is simple). The objects 9 AA are hermitian: (9 AA )* = 9f A , where one also should take 
into account the fact that under the operation of complex conjugation the space of unprimed spinors 
goes into that of primed ones and vice-versa: 

(S+T = 5_. (7) 

We can finally combine the tetrad with the object 6f A ' just introduced to form a new object 
9 AA ' = e^Of^ that is referred to as the soldering form. This object provides an identification between 
the space S + S- of mixed rank two spinors and the tangent space to our spacetime manifold M 

9 :TM -»• S + ® S- (8) 

As e that is used in its construction, it also carries information about the spacetime metric. The 
soldering form can be used to construct the Dirac operator V AA := \/2 9 AA V M , where is the 
metric-compatible derivative operator, and we have raised the spacetime index on V using the metric. 
The Dirac operator, with its spinor indices raised or lowered appropriately using the SL(2, C)-invariant 
metrics on S± becomes a map sending spinors of one type into those of opposite type, e.g.: 

V : S+ -> S-. (9) 

We are now ready to discuss the spinorial interpretation of the objects that appear in our gauge- 
theoretic formulation of gravity. 

1.2 SL(2, C) connections 

The main dynamical field of our theory is a complexified SO(3) ~ SU(2) and thus SL(2, C) connection. 
Locally its is a one-form on M taking values in the Lie algebra g ~ sl(2) of the gauge group. We 
will always think about the Lie algebra as a complex vector space of dimension 3. In index notations 
the connection is denoted by A 1 , where i = 1,2,3 is the Lie algebra index. As we shall see in 
details below, when the action of the theory is linearized around a suitable background connection, 
the background field allows for a certain metric to be defined. So, the linearized theory is about 
infinitesimal connections that we denote by living on a metric background. The metric allows us 
to define the usual notions of tetrad and then the spinors, as discussed above. We will then see that 
the structures available in the background field allow us to identify the Lie algebra g with the space 
of symmetric rank 2 unprimed spinors 

g-si- (10) 

Indeed, as is well known, the Lie algebra sl(2) of the Lorentz group (viewed as SL(2,C)), when 
considered as a complex vector space of dimension 3, is isomorphic to the second symmetric power of 
the fundamental representation. The background field then identifies the Lie algebra g of the gauge 
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group of the theory with the Lie algebra of the Lorentz group 5/(2) acting in each tangent space, and 
this is why (fTUj) becomes possible. 

Also, as we have already discussed, the spacetime index of our infinitesimal connection one-form 
can be converted into a pair of spinor indices using the soldering form 0^ . Thus, overall, mapping 
all the indices of the infinitesimal connection into spinor ones we get an object 

a AA'BC eS 2 + ® S+ ® S _ 

Thus, our linearized theory is about fields living in the above spinor representation. This should be 
contrasted with the usual metric description where the metric perturbation field h au , when converted 
into the spinor form becomes 

h A A'BB' e (S+®S-)® S (£+<»£-), (12) 



where <S> S means the symmetric part of the tensor product. Both fields (jlip and (|12p are capable 
of describing a spin 2 particle (this follows just by counting the number of the fundamental spinor 
representations appearing, and multiplying the result by 1/2, which is the spin carried by the funda- 
mental representation). However, there is a profound difference between the two descriptions. The 
spinor space relevant for the usual metric description goes into itself under the operation of complex 
conjugation: 

((S+ ® S-) ® a (S+ S-))* = (5+ S-) ®s (S + S-). (13) 

However, the space in ([lip under the operation of the complex conjugation gets sent to a completely 
different space 

(Si ®S+®S*_)* = S 2 (14) 



This is why there are real objects in the space in ([12)1. but no real objects in the space in ([lip . In 
other words, the description of spin 2 particles is possible in terms of real fields if one uses fields such 
as hn U , but cannot be possible if one uses the connection field in ([lip . This is the first conclusion that 
can be made about our prospective gauge-theoretic description of gravity even prior to developing it. 
As a result of this basic fact, the issues of reality conditions and hermiticity of the Lagrangian will 
have to be dealt with in a way significantly more non-trivial than in the metric based description, see 
more on this below. 

Let us ignore the issues of hermiticity for the moment, and discuss how the diffeomorphisms, which 
are the fundamental gauge symmetries of any theory of gravity, can be represented in our formalism. 
In the usual metric language the diffeomorphisms act via 

<Vfyi/ = V^fj,), (15) 

where is the diffeomorphism generator. The important point about this transformation rule is that it 
involves the (first) derivatives of the generator. Therefore, the question of which components of hup are 
pure gauge is mode-dependent, and can be answered only after the metric perturbation is decomposed 
into modes via an appropriate Fourier transform. The space ()12p where metric perturbations live 
has dimension 10 (per spacetime point). The Hamiltonian analysis of gravity then tells us that 4 of 
the components of the metric perturbation field hau get the interpretation of Lagrange multipliers 
imposing 4 constraints. This removes 4 + 4 = 8 components, leaving only 2 propagating degrees of 
freedom of the graviton. 

Let us now discuss a similar count of degrees of freedom in our gauge-theoretic description. The 
first fundamental difference is, as we shall see in details below, is that the connection transformation 
rule under the diffeomorphisms is much simpler than (|15p . Thus, it turns out that the action of the 
diffeomorphisms is described by first decomposing the space in ([lip into its two irreducible components 

Si ®S+®S_ = Sl ®S-®S + ®S-, (16) 
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where we have used the elementary representation theory fact that <g) S + = © S + . One then 
finds that from the two parts of the connection arising this way, the part taking values in S+ (8) S— 
can be set to zero by an action of a diffeomorphism. In other words, S+ <£> S- is pure gauge, and we 
can describe the space of (infinitesimal) connections A modulo diffeomorphisms in full generality as 

A/dffieos = Si ®5_. (17) 

Importantly, this decomposition into a gauge and non-guage parts is mode-independent, and is possible 
already at the level of the Lagrangian, prior to any mode decomposition. This happens because it 
turns out to be possible to write the formula for the action of a diffeomorphism on the connection in 
a special way. Namely, in a gauge theory one has a freedom to talk about diffeomorphisms modulo 
the usual gauge transformations. Then one can write the formula for the infinitesimal diffeomorphism 
in such a way that it does not contain any derivatives of the generating vector field £. Explicitly, the 
action reads 

<K = ( 18 ) 

where F l v is the background curvature two-form. There are no derivatives of £ in this formula, 
and this is why the decomposition Q17|) becomes possible. Below we shall see that the way that the 
decomposition (|17p is realized at the level of the action is that the Lagrangian is simply independent 
of the S+ ® SL components of the connection. 

To summarize, in our gauge-theoretic formulation, the diffeomorphisms are much easier to deal with 
than in the usual metric description. The components of the connection that are pure (diffeomorphism) 
gauge can be projected out already at the level of the Lagrangian and the action becomes a functional 
on the 8-dimensional space (|17|) . On this space one still has the usual 5/(2) gauge symmetries acting, 
with 3 of the 8 components of the projected connection field in (|17p being Lagrange multipliers for 
3 constraints. At the end one gets the usual 8 — 3 — 3 = 2 propagating modes of the graviton, but 
in a way completely different from the metric description. As we shall see below, in our description 
one will only need to gauge-fix the usual si (2) gauge symmetry, like one would be doing in Yang-Mills 
theory. In contrast, in the metric description one has to gauge- fix the diffeomorhisms, which leads to 
an arguably more involved formalism. Also to be emphasized, in our gauge theoretic description one 
will be dealing with only 8 components of the field per point, while in the metric description one has 
10. Last but not least, as we shall see below, our gauge-fixed Lagrangian is actually a convex function 
in the field space, with all the modes having the same sign in front of their kinetic terms. This is not at 
all the case in the metric description, with one of the modes, namely the trace hp, having an opposite 
sign in front of its kinetic term as compared to the other modes. This is the infamous conformal mode 
problem of the Euclidean approach to quantum gravity. This problem is absent in the present gauge- 
theoretic formulation of gravity, with the Euclidean signature Lagrangian (when all the fields become 
real) being a non-negative (i.e. convex in non-flat directions) function in the field space. This fact, as 
well as other simplifications resulting from the possibility to project away the diffeomorphisms from 
the outset, should be viewed as the main reason for taking the present gauge-theoretic formulation as 
a serious alternative to the usual metric-based one. We refer the reader to |6] for a further discussion 
of the above points. 

1.3 Fermions 

Above we have seen that our infinitesimal connection field cannot be real, as it takes values in a space 
that does not go into itself under the complex conjugation. Of course, the full complex-valued field 
then describes twice more real modes than is needed (with the extra half of the modes coming from the 
complexification badly behaving). Thus, one does need to impose some reality conditions if one wants 
to get a satisfactory description of spin 2 particles. The way this happens turns out to be strongly 
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analogous to what happens in theories of fermions, i.e. spin 1/2 particles. Thus, let us briefly discuss 
the usual fermions in Minkowski spacetime first. 

A possible (and in fact rather powerful, but not commonly known) approach to fermions is to 
describe them by a second-order in derivatives action, treating the original Dirac first-order equation 
as a reality condition for the fermion field. This gives a completely equivalent description to the usual 
one, and can also be shown to lead to some simplifications in the computations of Feynman diagrams, 
see e.g. [7] for an emphasis of this fact. 

To describe this in some details, let us only discuss here the case of a single Majorana fermion, 
which is the simplest (and is also enough for our purposes of drawing an analogy). In the usual 
first-order Dirac like formulation this is described by the Lagrangian 

^Majorana = bfiX^tf* &>\ A ~ {m/2)\ A \ A - (m / '2) \\, tf A ' , (19) 

where A^, \\, are two anti-commuting 2-component spinors and Xa, is the hermitian conjugate of A^. 
The above Lagrangian is hermitian modulo a surface term, as can be checked by an easy computation. 

In the second-order description one integrates out the primed spinors X A , (using the fact that at 
the level of the path integral it is legitimate to treat A^, X A , as independent fields. To do this one uses 
the field equation for Xa, that reads: 

A tA' = b^e AA 'd»x A . (20) 

m p 

One then substitutes this back into (|19|) to obtain (after using some algebra of soldering forms) 

^■Majorana = ~~^~9 ,J 'X A d t j l X A — A^A^, (21) 

which is just the Lagrangian that gives the Klein-Gordon equation for each of the two components 
of X A . It can then be shown that the theory (|2ip supplemented with the reality conditions (|20p is 
completely equivalent to the original theory (|19p . Of course, the Lagrangian (|2ip is not hermitian, 
but instead depends holomorphically on the spinor field X A . It only leads to a theory with a hermitian 
Hamiltonian once the theory is restricted to live on the space of fields satisfying (|2U|) . There are some 
subtle points here about on-shell versus off-shell correspondence, and this will be further discussed in 
the main text, when contrasting with what happens in our gauge-theoretic description. 

It is worth discussing the reality condition (|20p from a more general viewpoint. Imagine we would 
like to start with (|2ip . and then find some appropriate reality condition that would give us a theory 
with a hermitian Hamiltonian. The spinor field A^ that we work with lives in the space and this 
space goes into S_ under the complex conjugation. Thus, the field cannot be taken to be real. We 
then need a more sophisticated real structure on the complex phase space of our theory, and this is 
provided by the Dirac operator. Indeed, the Dirac operator maps spinors of one type into those of the 
other. Thus, we can combine the action of the Dirac operator with that of the complex (hermitian) 
conjugation to define 

K:=— do], (22) 
im 

where d here stands schematically for the Dirac operator as we defined it above. The 7£-operator 
is an anti-linear map sending the space of unprimed spinors into itself. Importantly, it becomes an 
involution 1Z 2 = Id on the space of solutions of the theory (|2ip . and is thus a real structure on the 
phase space when the latter is viewed as the space of solutions of field equations. The reality condition 
([2"0]) is then just the condition selecting the real section of the phase space with respect to the real 
structure TZ. This gives an equivalent viewpoint on the usual theory of fermions that works with 
first-order hermitian Lagrangians, but also leads to some important simplifications in computations 
with fermions, as is emphasized in [7J. So, this is a valid viewpoint on the fermions. As we now 
discuss, gravitons in their gauge-theoretic formulation share many similarities with this description of 
fermions. 
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1.4 Reality for gauge-theoretic gravitons 

We now come back to the description of the gravitons as connections taking values in (|lip . or, after the 
diffeomorphism components have been projected away, in (|17|) . As we shall see, the resulting linearized 
Lagrangian on this space is a meromorphic function of the connection, leading to a second-order in 
derivatives field equation. Since the connection takes values in <8> S-, and this space is not invariant 
under the operation of complex conjugation, the connection cannot be real. However, we can now use 
the above second-order treatment of fermions as a guide, and device an appropriate reality condition 
that will make the Hamiltonian hermitian. 

The idea is to cook up an anti-linear map from the space S+ <8> S- into itself by combining the 
operation of the hermitian conjugation of the field with the action of an appropriate differential 
operator. The operator that we have at our disposal is the Dirac operator V. Note that we now 
work in a curved background, and so refer to the Dirac operator as V in contrast to d above. The 
importance of the curved background will be explained below. The Dirac operator converts one spinor 
index into the index of an opposite type. Thus, if we take the complex conjugate of an object in 

<8> S- we get an object in <8> S+. To convert this into an object in the original space ® S- 
we need to flip two of the spaces S- to become S+. Thus, we will have to apply the Dirac operator 
twice. In other words, a possible reality condition must be of the form 

K ~ V 2 o f. (23) 

We now note that in the case of the Dirac theory we had the mass parameter that allowed to make the 
dimensions match in (|22p. so that TZ is a dimensionless operator. For the graviton there is clearly no 
mass parameter that can be used, as the graviton is massless. It is for this reason that our description 
of gravitons only makes sense in a curved background, where the radius of curvature of the background 
can provide the missing dimensionful parameter. This provides yet another explanation of why the 
gauge-thereotic description of gravity only works properly when A / 0. Below we shall see that it is 
the mass parameter associated with the curvature M 2 ~ A whose inverse power will be sitting in (|23|) 
to make the dimensions match. We will also see that, on solutions of field equations, an appropriately 
designed anti-linear operator of the form (|23p becomes an involution, and thus defines a real structure 
on the space of solutions (=phase space). After the corresponding real section is selected, one obtains 
a theory with a hermitian Hamiltonian. In fact, as we shall also demonstrate, the corresponding 
complex description of the phase space of gravitons is just a (complex) canonical transformation of 
the usual phase space in terms of the metric perturbation. So, at the level of the (reduced) phase 
space the two descriptions will be shown to be completely equivalent. 

We summarize by saying that our gauge-theretic description (to be developed in the main text) 
is completely equivalent to the standard description at the level of the fully symmetry reduced phase 
space. However, the connection viewpoint on gravitons brings some important simplifications into 
the perturbation theory, as could be suspected from the fact that the theory now depends on less 
components of the field to start from (8 as compared to 10). A related fact is that in the gauge- 
theoretic description the field takes values (after the diffeomorphisms have been dealt with as in 
(|17|)) in an irreducible representation 5^ (g) S- of the Lorentz group. This is in contrast to the 
usual description, where one must build up the perturbation theory working with all the components 
of the metric perturbation. These split into two irreducible components S+ <8> and the trivial 
representation (functions on spacetime). The two irreducible components behave very differently, 
and part of the complexity of the standard perturbation theory consists in dealing with these two 
different components. This problem is absent in our treatment, and will be seen to result in many 
simplifications in the formalism. 

Now that we have explained the main unusual points of our construction, we can start with 
our development of the diffeomorphism invariant SO (3) ~ SU(2) gauge theory, which will be shown 
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to describe gravity. We start with a formulation of the theory in Section [2j We then discuss the 
background and obtain the linearized Lagrangian in Section [3l The resulting free theory is described 
in details in Section [H where also the Hamiltonian analysis is performed. Section [5] is central for the 
whole story and discusses the subtle points related to the reality conditions. It also introduces the 
metric variable, in terms of which one has the familiar dynamics. Section [6] shows that the passage 
to the metric variable is a canonical transformation on the phase space of the theory. The mode 
decomposition is obtained in Section and then the discrete symmetries are discussed in Section [SJ 
We conclude with a discussion. 

2 The theory 

The contents of this section are not new. Some more details on diffeomorphism invariant gauge 
theories described below can be found in [6]. General Relativity (with A ^ 0) was first formulated in 
this language in [T]. 

2.1 Diffeomorphism invariant gauge theories 

We begin in full generality, and define a large class of what can be called diffeomorphism invariant 
gauge theories for an arbitrary gauge group. Thus, let G be a (complex) Lie group, which we for 
simplicity assume here to be simple. Consider a G-connection on the spacetime M. Locally it can 
be described as a one-form A 1 with values in the Lie algebra g of G. Thus, here and in what follows 
/ = 1, . . . , n is the Lie algebra index. The curvature of the connection is a two- forms with values in g 
that can be described as 

F i = d A I + lf I JK A J AA K , (24) 

where f 1 jk are the structure constants. 

Now let / be a scalar valued function acting on symmetric matrices in g ® s q: 

f: Q ® s Q^C. (25) 

We require this function to satisfy two properties: (i) It must be gauge invariant f(Ad g X) = 
f(X),Vg £ G; (ii) It must be homogeneous of degree one f{aX) = af(X),\/a ^ 0. Both condi- 
tions are required to hold for any X £ g s g. 

Having such a function, it is not hard to see that it can be applied to the quantity F 1 A F J , with 
the result being a well-defined 4-form. Indeed, F 1 A F J £ A 4 (g) g ® s g, i.e. it is a 4-form with values in 
the space of symmetric matrices. We can apply the function / to it, and the result is gauge- invariant 
due to the gauge-invariance of /. At the same time, the 4-form factor can be just "taken out" from 
the function due to its homogeneity, and so one gets a well-defined 4-form. Integrating this over the 
manifold one gets the action 

S[A]=i [ f(F A F). (26) 

J M 

Several remarks about this action are in order. First, the factor of i = y— T is introduced for future 
convenience. Second, there are no dimensionful coupling constants in our theory. Indeed, there are 
only dimensionless parameters involved in constructing the function /. All the dimensions are carried 
by the fields, so that the connection A has the mass dimension one, and the curvature has the mass 
dimension 2. The Lagrangian then has the required mass dimension 4 by the homogeneity of /. 
Below we shall see that the dimensionful coupling constants get introduced into this theory when a 
suitable background is selected (as combinations of the mass scale of the background with the other 
dimensionless parameters present in /). 
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Another remarks about (|26p is that its field equations are of the second order in derivatives. This 
is easy to see if we write the equations as 

d A S 7 = 0, where B 1 := ^jjF J ', (27) 

and where the matrix X IJ = F 1 A F J . As we shall see below, the matrix of derivatives of / with 
respect to is a well-defined matrix- valued function (not a form) of homogeneity degree zero acting 
on A 4 ®$j(g> s 0. Thus, the quantity B 1 is a well-defined 2-form with values in the Lie algebra. The field 
equations are then just a statement that B 1 is covariantly constant with respect to the connection A. 
Let us now count the number of the derivatives appearing in the equation ()27p . The function /, as 
well as the matrix of its first derivatives, are (highly non-linear) functions of the first derivatives of A. 
Then another derivative is taken in (|27p . which results in second-order field equations. 

Our last remark about (|26|) is that for a generic / they are dynamically non-trivial theories, i.e. 
describe propagating degrees of freedom. The clause about generic / is important, for there is one 
point in the theory space corresponding to f{F A F) = Tr(F A F) which gives a topological theory 
without any propagating modes. But this is clearly a very special point in the theory space because, 
as we shall see below, whenever the Hessian of the function / is non-degenerate there are propagating 
modes. For a generic / it can be shown by a Hamiltonian analysis, see [8] for such an analysis in a 
different, but related description, that the theory (|26|) describes 2n — 4 propagating modes. 

2.2 Gravity 

It turns out [I] (and this will be shown below) that when one takes G = SL(2,C), viewed as a 3- 
dimensional complex Lie group (i.e. as a complexification of SU(2)), the above theory describes, for 
any choice of the defining function /, interacting massless spin 2 particles. This statement does not 
take into account the reality conditions issues, as discussed in the Introduction. In other words, we 
do not know if there is a choice of the reality conditions that render a theory with arbitrary / to have 
a hermitian Hamiltonian. However, what we will show in this paper is that, when linearized around 
an appropriate background (which is going to be just de Sitter space in the language of connections), 
all theories ([26]) with G = SL(2,C) lead to the same linearized dynamics. This dynamics is that of 
massless spin 2 particles, and then (linearized) reality conditions can be imposed to yield a positive- 
definite hermitian Hamiltonian. Thus, there is a satisfactory treatment of the reality conditions issue 
at the linearized level for any /. Whether this can be extended to the full non-linear level is an open 
problem, apart from the case of / that corresponds to GR, where the correspondence to GR implies 
that there is a satisfactory solution to the reality conditions problem. 

2.3 General Relativity 

General Relativity with a non-zero cosmological constant can also be described in this language, and 
is just a particular point in the theory space (|26|) . In this case the action reads, see [T| 

where G is the usual Newton's constant, A is the cosmological constant, i = and F" 1 = dA l + 

(l/2)e l i k Ai A A k is the curvature of A 1 . Due to the presence of the factors of imaginary unit in front 
of the action, and also because of the fact that the connection is complex (reality conditions will be 
describe below), it is not obvious that this action describes a theory with unitary dynamics. Still, as 
we shall see in particular from the graviton scattering results (in the second paper from the series), it 
describes the usual general relativity. An argument establishing equivalence to the usual metric based 
GR at the full non-linear level is given in pQ. Thus, we know for sure that at least for one of the 
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members of the class (|26p the issue of reality conditions at the full non-linear level can be dealt with 
satisfactorily (by going to the usual metric-based real description). 

The square root of a matrix appearing in the action has to be understood perturbatively, as we 
shall explain (and explore) below. Note that the Newton's constant appears in front of the action 
only in the dimensionless combination GA. This is of course also possible in the usual metric-based 
formulation if one rescales the metric to absorb A into the volume factor y/—g. The metric then 
becomes dimensionful and A appears in front of the action exactly as in (|28|) . Our final remark about 
(|28p is that it gives only an on-shell equivalent formulation of general relativity, while off-shell the 
action (|28|) has different convexity properties from the Einstein-Hilbert one. This is of no importance 
for the present and the second paper from the series, where only the tree-level scatting amplitudes 
are studied, since these can be expected to be the same as in GR. However, one should be cautious 
when comparing the (to be constructed) quantum theory based on ()28p with the one based on the 
Einstein-Hilbert functional. Even though the phase spaces of both theories are the same (viewed as the 
spaces of solutions of field equations), there can be subtleties (e.g. in the measure) when comparing 
the path- integral based quantum theories. We do not touch these issues any further in the present 
work. 

3 Perturbative expansion 

The treatment of the background below is along the lines of 0. A more in depth discussion of the 
mass scale introduced by the background in available in [6J. The perturbative expansion of the action 
is to a large extent new, with only a very preliminary discussion available in [4J. 

3.1 The background 

We are (eventually) interested in developing Feynman rules for the theories (|26p . and, in particular, 
for (|28p . One immediate difference with the case of the metric-based GR is that we cannot directly 
expand around a background that corresponds to the Minkowski spacetime. Indeed, our action ()28p . 
strictly speaking, only describes the A ^ situation, as it blows up if one sends A — > 0. Thus, the 
best we can do (if we are after the Minkowski spacetime scattering amplitudes) is to expand around 
a constant curvature background and take the curvature scalar to zero at the end of the calculation. 
This is the strategy that will be followed here (and was previously followed in [4]). As we shall see 
below, the presence of the cosmological constant at intermediate stages of the computations will make 
to us available constructions that are simply impossible in the usual metric setting of zero A. 

We shall consider perturbations around a fixed constant curvature background connection. To 
explain what constant curvature means in our setting let us start by describing a general homogeneous 
and isotropic in space SO(3) connection. First, a general homogeneous in space connection is of the 
form 

A* = a ij (rj)dx j + b i (r J )dr I , (29) 

where we have indicated that the components can only be functions of the time coordinate rj. It is 
obvious that we can kill the b l (rj) components by a time-dependent gauge transformation. This leaves 
us with the first term only. We now require that the effect of an SO (3) rotation of the coordinates x l 
(around an arbitrary center) can be offset by an SO(3) gauge transformation. This implies that a 13 
must be proportional to S 13 for all ij. Thus, we are led to consider the following connections: 

A i = ^-dx\ (30) 
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where the function c{rj) is arbitrary, and we have introduced a factor for 1/i for future convenience. 
We now note that the curvature of this connection is given by 

F i = jdr] A dx i - je ijk dx j A dx k , (31) 

where the prime denotes the derivative with respect to r]. This means that we have 

F*AF i ~S ii . (32) 

Thus, for our chosen background (|30p the matrix X 13 is proportional to the identity matrix, which 
means that the matrix of first derivatives of the function f(X) is also proportional to the identity. 
This implies that any connection (|3U|) satisfies the field equations following from (|26|) 

: ' <33) 

as these equations reduce to the Bianchi identity D^F 1 = 0. This happens for any /, i.e. for any of 
the theories in our theory space. 

We now note that the curvature (1311) can be written as 



F { = -c 2 (^dv A dx i + ^e ijk dx j A dxA . (34) 



We can now choose the time coordinate conveniently, so that 



c' 

-zdrt = dt, (35) 
c z 



and then write 

(36) 



F i = -c 2 (idt A dx l + ^e ijh dx j A dx k ^j , 



where c should now be thought of as a function of t. In fact, from (|35p we have dc/dt = c and thus 

<t) = (37) 
t — to 

where to is the integration constant. All in all, we see that, by an appropriate choice of the t coordinate, 
we can rewrite the curvature of any of the connections (|30p as 

F i = -M 2 T,\ (38) 

where 

E i = a 2 (idt A dx i + ^e ijk dx 3 A dx k ^j (39) 
are the self-dual two-forms for the de Sitter metric 



ds 2 = a 2 



-dt 2 + ^2(dx^ , (40) 



and 



a{t) = -mr^ (41) 
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is the usual de Sitter scale factor as a function of the (conformal) time t. Note that we have introduced 
an arbitrary dimensionful parameter M in (|38p . This parameter is directly related to the radius of 
curvature of the de Sitter metric (|40p . It is completely arbitrary, as we can always rescale both M and 
E* in (|38p without changing the curvature. But once introduced, it determines the metric, and thus 
determines how all scales in the theory are measured. The condition (|38p . which as we saw can be 
always achieved by choosing the time coordinate appropriately, is our constant curvature condition for 
the background connection. The essence of this condition is that it introduces a (background) metric 
into our background-free up to now description, and fixes how all scales are measured. 

It is worth discussing the construction that introduced a metric into our so far metric-free story 
in more details. This is a geometrical construction known for many years, and is in particular due to 
[10] . The idea is that when the triple of curvatures F l of the connection A 1 is linearly independent, 
the 3-dimensional space that it spans in the space of all 2-forms can be declared to be the space of 
self-dual 2-forms for some metric. It is then known that this determines the metric modulo conformal 
transformations. This is precisely how the metric (|40p appeared from the background connection (|30p . 
We have also made a further choice of the conformal factor by so that the connection becomes one of 
constant curvature in the sense of equation (|38p . Fixing M in that equation to be constant eliminates 
the conformal freedom in the choice of the metric, up to constant rescalings. A choice of a particular 
constant M 2 in that equation is then equivalent to a choice of units in which all other quantities in 
our theory are measured. In this sense M is not a parameter of the theory, it is rather a scale in terms 
of which all other scales in the theory get expressed. Thus, e.g. in the second paper in the series we 
shall see how the gravitons' interaction strength (Newton constant) appears as constructed out of M 
and the dimensionless coupling constants present in our theory. 



3.2 Working with functions of matrix-valued 4-forms 

We should now explain how a function (e.g. the square root in (|28p ) can be applied to forms. We do this 
in a way most convenient for practical compuations. Thus, it is convenient to use a completely anti- 
symmetric density e^ vpa available without any metric to construct the following densitiezed matrix: 

* ij = \^F; V F^. (42) 
The general action (|26|) for G = SL(2, C) then bomes 

S[A]=i [ d 4 xf(X ij ). (43) 



One can now see that the integrand is a density weight one scalar, and so the integral is well-defined. 
The field equations then take the form 

where the matrix of first derivatives that appears is now just that of usual derivatives of a function of 
a matrix with respect to the matrix components. For GR action ([28]) written in terms of X tJ we have: 



iM : 



Sgr[A] = ^ J d A x (TrVx) . (45) 

Here we have introduced M 2 := l/16vrG,M 2 := A/3. What we have now is the square root of 
a symmetric 3x3 matrix, and this is well-defined (at least for matrices that are not too far from 
the identity matrix). The action in the form (|45|) will be our starting point for developing the GR 
perturbation theory (in the second paper from the series). 
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3.3 A convenient way to write the action 



Let us now consider the value of X 13 at the background. We have 



M 4 

X ij ± _ gwp* 5.^^ = 2iM 4^ ; (46) 



where our convention is that the hat means " evaluated at the background" . Here we made use of the 
self-duality of E's and the algebra (|155p of S's. It is very convenient to rescale the X variable by 
2iM 4 ^/— g so that the result equals to the Kronecker delta on the background. Thus, we introduce: 

X' J 

X 13 := --^ = 5 l K (47) 

2iM 4 ^/^ v 1 

We now rewrite the general gravity action (|43[) in terms of X. We have: 

S[A] = -2M 4 J d 4 aV=^/(X^'). (48) 
For the GR action ()45[) this becomes: 

S GR [A] = -\MlM 2 J d 4 x^g (irv 7 !^) 2 . (49) 
It then becomes a simple exercise to compute the variations of the action, see below. 



3.4 Evaluating action at the background 

Let also discuss the value of the actions (|4"8j) and (f4U|) when evaluated on the background. We have, 
for the general action: 

S[A]= -2M A f(5) J d 4 x^g~. (50) 

For (|49p this becomes 

S GR [A]= - 6M p 2 M 2 J d^x^-g = J d 4 x^ , (51) 

which is the same as the value of the Einstein-Hilbert action 

SehIq] = -J^q J d * x V=g (R - 2A) (52) 

evaluated on the de Sitter metric (|40p . We see from (|50p that for a general theory the dimensionless 
quantity f(5) plays the role of a combination 3M 2 /M 2 in the case of GR. We emphasize, however, that 
for a general theory there is no notion of the Planck constant, at least not until graviton interactions 
are considered. In the second paper of the series we compute the graviton interactions strength and 
will extract an appropriate dimensionful coupling constant this way. It is however, not guaranteed that 
the Planck mass obtained from this Newton constant will be related with the dimensionless parameter 
f(5) in front of the background-evaluated action in exactly the same way as in GR. 
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3.5 Variations 



We start by computing the variations of X, as a function of the connection, evaluated at the background 
=5 ij . We have: 

6X^= -^E^D^SAl\ (53) 
5 2 XV = J^e^D^AiDpSAl - e^ kl 5 A^ A l u , 

5 3 X^ = r^e^D^A^e^SApA^. 

Finally, the fourth variation is zero 5 X tJ = even away from the background. In all expressions 
above is the covariant derivative with respect to the background connection. Thus, it is important 
to keep in mind that D's do not commute: 

2D {ii D v ^ = i^Fl v V\ (54) 

for an arbitrary Lie algebra valued function V % . Here is the background curvature (|38p . Thus, 
the commutator (|54p is of the order M 2 . This has to be kept in mind when (in the limit M — > 0) 
replacing the covariant derivatives D with the usual partial derivatives. 

3.6 Variations of the general action 

We will now explain a procedure that can be used for computing the perturbative expansion of the 
action (|48p . It is completely algorithmic, and is not hard to implement to an arbitrary order. In this 
paper we will only need the second variation, but we decided to explain the general procedure already 
here since once the general principle is understood, it is not hard to implement to get the interactions 
as well. First, les us define a convenient notation 



(n) _ d n f 

dX^dX kl ... 



f 

J ijkl 



where the derivatives are all evaluated at the background X lJ = b 13 . The variations of the action are 
then given by: 

5S= - 2M 4 J f^5X ij , 5 2 S = - 2M 4 J \jlf kl 5X ij 5X kl + f^5 2 X ij ^ , (55) 



5 3 S = - 2M 4 



+4f® l 6 3 X i >8X kl + Zflf kl 5 2 XV8 2 X kl 

Below we shall explain how the derivative matrices appearing here can be parameterized conveniently. 
However, let us first consider the special case of the GR action. 

3.7 Variations of the GR action 

For the case of GR we have 

M 2 / nr\ 2 

/GR = ^ Tr (VX) , (56) 
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The variations are now easily obtained by defining Y = yX, and writing 

S GR [A] = -^M 2 M 2 j (Try) 2 , (57) 

where we have dropped the integration measure d^x^J—g for brevity. The variations are then easily 
computed: 

5S GR [A] = ~M 2 M 2 j 2 Tr (Y) Tr {SY) , (58) 
5 2 S GR = — ^M 2 M 2 J 2 [ Tr (5Y) Tr {SY) + Tr (Y) Tr (5 2 Y)] , (59) 
S 3 S GR = —^MpM 2 j 2 [3 Tr {SY) Tr (5 2 Y) + Tr (Y) Tr {5 3 Y)] , (60) 
S 4 S GR = — ^M 2 M 2 J 2 [3Tr (5 2 Y) Tr (5 2 Y) + 4Tr (SY) Tr (J 3 Y) + Tr (Y) Tr (5 4 Y)] . (61) 

It thus remains to obtain a relation between the variations of Y and those of X. This is easily 
done by varying the relation Y 2 = X (any required number of times), and then solving the resulting 
equations for 5 k Y. We only need these variations on the background, where we have Y % ^=SK This 
procedure gives: 

5Y= l -5X, (62) 



(63) 



5 2 Y=-5 2 X - 5Y5Y = -[5 2 X- -5X5X ] , 
2 2 V 2 J ' 

<5 3 Y = -5 3 X - -<5Y5 2 Y - -5 2 Y5Y = -5 3 X - - (8 2 X5X + 6X5 2 X - 5X5X5x) , (64) 
2 2 2 2 8 V / 

S 4 Y = -25Y5 3 Y - 25 3 Y5Y - 65 2 Y8 2 Y. (65) 



The above results can be put into the general form (|55p by writing: 



(3M 2 /M 2 )f^ = 3<%, (66) 



m 2 /M 2 )fg = --P m , (67) 

(3M 2 /M 2 )/^ mn = - Ty PijabPklbcPmnca + {^ijPklmn + &klPijmn + ^mnPijkl) (68) 

perm 

(3M /Mp)f^1 lmnpq = — — ^2 JjPijabPklbcPmncdPpqda + ^ ^ -^PijklPmnpq + 

where 



ijao^ kloc^ mnca-*- pqaa i g / ^ ^ 
perm ' perm 



Pijkl ■= ^ikSjl + SuSjk) ~ ~Sij5kl (69) 

is the projector on the symmetric tracefree matrices, and the dots in the last formula stand for terms 
containing at least one 5{j in one of the 4 external "legs". The sum over permutations in the last 
two formulas is needed to make the result on the right-hand-side symmetric. Eventually we are going 
to contract f^ 3 \ with copies of the same matrix 8X^ , and this sum over permutations (with the 
associated combinatorial factor) will disappear. Also, the reason why we don't write the remaining 
terms in the expression for is that (in the second paper from the series) we shall see that these 
terms will not play any role (in the 4-vertex) as they will be killed on-shell by the external states, or 
killed by the symmetries of the propagator when the vertices are used in Feynman graphs. 
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3.8 Matrices for a general / 

For the case of a general theory we can to a large extent fix the derivatives of the function / evaluated 
at the background X lJ = S 13 from the properties of / itself. Thus, we know that / is an SO(3) 
invariant function. The background that we work with is also SO(3) invariant. Thus, the same will 

(n) 

be true for the matrices f^ kl . This, in particular, implies that the matrix of first derivatives must be 
proportional to 5ij. The proportionality coefficient can then be fixed from the homogeneity property 
of / that implies 

d f xij = f ( 70 ) 



dX* 
Thus, we have 

Slf = (Tl) 

We also know from (|50p that f{5) is the analog of the parameter 3Mp/M 2 in GR for a general theory. 
We can now differentiate the equation (|7U|) once with respect to X 13 to obtain 

J-K X» = 0. (72) 
dX^dX* 1 

In other words, the background itself is among the flat directions of the Hessian of /. This, together 

f (2) 
lijkl 



(2) 

with the SO(3)-invariance of the matrix f^-l, implies that it is of the form 



fijkl ~ o^^' 

where g is some parameter and Pijki is the projector (|69p introduced above. This must be true for any 
/. Note that this is also true for the function f(X) ~ Tr(X), i.e. for the topological theory, but in 
this case we have g = 0. We shall see that there are propagating degrees of freedom whenever g ^ 0. 
Finally, we note that we have put a minus sign in (|73p because there is one in the case of GR, see 
(|67p . It is natural to be interested in theories that are not too far from GR, and so it is natural to 
have the same sign in (|73p as in GR. For this reason we shall assume g > in what follows. 

(n) 

The higher derivatives f-j can all be determined in a similar fashion. Thus, one takes higher and 
higher derivatives of the equation ()70p and evaluates the result on X %3 = 5 lJ . One gets 

f( n ) fjinjn , I _ 2) f («-!) _ n (JA) 

which is a recursive relation for the matrices of derivatives. We see that the new independent term 
that appears at each order is always of the form of n projectors (|69p contracted with each other in a 
loop, with a symmetrization over index pairs ij later taken to form a completely symmetric expression. 
There are also terms where the projectors are contracted in smaller groups. Thus, we can write 

f( n ) - (_i)n-l (n) ST^ }_p. . p. . p. . , 

J iljii2j2...i„j„ ~ V I il / j n \ r nn a nai r 12320-10.2 ■ • ■ r t n ]na n -\a n T" • • ■ , 

perm 



where the dots denote terms that contain smaller groups of P contractions, as we as terms that do 
not vanish when contracted with <5j,- in one of the channels. The coefficients in front of these latter 
terms are related to the lower g( n ' via (|74p . For example, for /( 3 ) we have 



fijklmn — 9^ ^ ^ ] ^ Pijab-Pklbc-Pmnca "H g {^ijPklmn ~\~ ^klPijmn "f" &mnPijkl) j (^) 



perm 
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where g = g^ . For the matrix of fourth derivatives we have 

f(4) _ (4) ST ]_p p p p , £(4) lp p ( 77 \ 

•> ijklmnpq " / ^ r ijab r klbc r mncd r pqda ' ij / ^ r ijkl r mnpq ■ ■ ■ j v'J 

perm ' perm 

where the other terms contain at least one factor of S and are not going to be important for us. Thus, 
the above parameterization of the derivatives of / makes it clear that for a general theory there is an 
infinite number of independent coupling constants g = g^ 2 \g^ , ■ ■ ., with a number of new couplings 
appearing at each order of the derivative of the defining function. In turn, we could have chosen 
to parameterize / by its independent couplings g^ n > . We (again) note that all these couplings are 
dimensionless. 

We would like to emphasize that the procedure used to obtain the action variations is completely 
algorithmic and can be continued to arbitrary order without any difficulty. 



4 Free theory 

The linearized action worked out below first appeared in [2], where also the Hamiltonian analysis (in 
the Minkowski limit) is contained. The novelty of this section is in the extension to the analysis to the 
more non-trivial de Sitter background. Also, the very compact form (|10ip of the completely symmetry 
reduced action is new. The most important new aspect of this section is in the realization that the 
connection cannot be taken to be real. This is invisible in the Minkowski version of the linearized 
action analysed in the previous works. Thus, our treatment of the reality conditions corrects and 
supersedes what appeared earlier in [1] and [6j. 

4.1 Linearized Lagrangian 

In this paper we only consider the linearized theory. The second order action (obtained as 1/2 of the 
second variation) reads: 



5(2) = J Ip^^D^dAl^DpSA^ - ^ (je^D^SAlDpSAi - M 2 ^ v e ijk 5A^5A 



We first note that we can integrate by parts in the second term, with the result canceling the last term 
precisely. One uses (|54p to verify this. The integration by parts is justified on connection perturbations 
of compact support (in both space and time directions), and this is what we assume. Let us also absorb 
the prefactor —g into the connection perturbation and define a new (canonically normalized as will 
be verified later) field 

^ := ^5A^. (78) 
The free theory Lagrangian takes the following simple form: 

£ (2) = -\Pw^ VD ^ kpaD P *l (79) 
In this section we study this theory in some details. We start by listing the symmetries of the theory. 

4.2 Symmetries 

The free theory f)79f) is invariant under the following local symmetries: 

Vt = V (g au § e )' %<i = t a tf ia (diffeo). (80) 
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Note that the action of diffeomorphisms in this language is very simple, and corresponds to mere 
shifts of the connection in some directions. The first formula here is the usual action of the gauge 
symmetry. The second formula follows by writing the action of diffeomorphisms (modulo a gauge 
transformation) on as (|18p . and then using the equation (|38p for the background curvature. The 
vector field appearing in (J80|) is then an appropriately rescaled one (by M 2 ) as compared to (fTS]) . 

The invariance under the usual gauge rotations is easy to see using the result for the commutator of 
two covariant derivative (|54|) and then the algebra (|155|) of S-matrices. To verify the invariance under 
diffeomorphisms we use the fact that D^T, l u ^ = (this follows from (j38|) and the Bianchi identity for 
the curvature). Writing this identity as 

D[ P K]a = -\Dc£p* (81) 
the variation of the Lagrangian (j79[) becomes: 

^£(2) = -P ijkl E^D^ai f-is^ZkEjJ . (82) 

Here we have used the fact that in the term where the covariant derivative acts on the £ field and 
the £ matrix is taken outside of the sign of the derivative, the algebra of the S-matrices gives an 
expression that is either anti-symmetric in 8 kl or a pure trace. Both are killed by the projector Pijki, 
and so only the term present in brackets in (|82p remains. But now we note that the expression in the 
brackets can be replaced with 

-\t a D a (s^SjL) 

in view of the /c^-symmetrization implied by the projector. This expression, however, is proportional 
to the covariant derivative of the Kronecker 5 in view of the algebra satisfied by S's, and this is zero. 
This establishes the invariance under diffeomorphisms as well. 



4.3 Hamiltonian analysis 

We now follow the textbook procedure of the Hamiltonian analysis of (|79p . to prepare the theory for the 
canonical quantization. Unlike what was done in [3] we would like to remain in de Sitter background 
and not take the M — > limit, at least not at this stage. We shall see that many subtleties, including 
those of the reality conditions, can only be understood for a non-zero value of M. So, we live in the de 
Sitter space (|40|) , with the self-dual two- forms given by (|39[) . We will also need a convenient expression 
for the background connection (|30p . and this is given by 

4 = ^(dx% = (H/i)(dx%, (83) 

where the prime denotes the (conformal) time derivative and we have introduced % = a' /a. The 
equation ([38]) . which is just the Einstein equation(s) in our language, then states H' = H 2 = M 2 a 2 , 
with the solution being a(t) = — 1/M(t — to), where to is an arbitrary integration constant. 

We now compute the quantity Y^^D^ai in terms of the temporal clq and spatial a\ components 
of the connection. We get: 

a^^D^al = -id t a ij + LD'oj + e ikl D k ai (84) 
where Di is the covariant derivative with respect to the background connection ()83p . Explicitly 

D k aj = d k aj - YHe^aJ 1 , (85) 
where we have used (|83p. The convention in (|84p is that the first index of a lJ is the spatial one. 



18 



We now decompose the spatial connection in its irreducible components 

a ij = ~ a a + e ijk c k + § ij c ^ (86) 

where a 13 is the symmetric tracefree component (i.e. spin 2). We substitute this into (JMJ) and 
immediately find that the spin zero component c gets projected away by the projector Pijki that 
multiplies this quantity in the Lagrangian. Keeping only the symmetric tracefree parts we get 

a 2 pY i. ^ Dfia l = _ idtS ii + id i (oj + ie 7 ' ) + e M d k aj + m~a ij . (87) 

We see that the dependence on the anti-symmetric part c l can be absorbed into a shift of the temporal 
part. We therefore see that only the spin 2 part a* J of the spatial connection is dynamical. We drop 
the tilde from now on. The conjugate momentum to a* J is 

tt« = d t a ij - Pd i (4 + + iB ij - Ua ij , (88) 

where we have introduced the "magnetic" field = Pe^ ikl dka{ , where P everywhere is the symmetric 
tracefree projector. The action in the Hamiltonian form becomes: 

S( 2 ) = I dt J d 3 x (ir tj d t a ij - H) , (89) 

where the Hamiltonian density is 

H = ^ - , - mjBV + Hnjj' ~ (ah + ic' , . (90) 

We have integrated by parts in the Gauss constraint term. Note that all instances of the conformal 
factor a have cancelled from the action. Indeed, we had a factor of a 4 coming from the measure \/—g, 
as well as a factor of a~ 2 twice coming from E's with the raised spacetime indices. 



4.4 Gauge-fixing 

It is convenient to fix the gauge at an early stage, and work with only the physical propagating modes. 
We see that the variation of the action with respect to the Lagrange multiplier a l gives the Gauss 
constraint 

diir ij = 0. (91) 

This constraint generates gauge transformations 

Sct^Pdfa, (92) 

where the projection is taken onto the tracefree part. This action can be used to set to zero the 
transverse part of a lJ : 

d ia ij = 0, (93) 

which is our gauge- fixing condition. Thus, our dynamical fields are a pair (a^vr*- 7 ) of symmetric 
traceless transverse tensors, as is appropriate for a spin 2 particle. We now note that the quantity 
e lkl dka\ is automatically symmetric tracefee and transverse on a l i that are symmetric tracefree and 
transverse. Thus, the projector in the definition of B 13 can be dropped. 
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4.5 Convenient notation 

The first-order differential operator a l i — > e^dkaj acts on the space of symmetric tracefree transverse 
tensors. It will appear on many occasions below, and so it is convenient to introduce a special notation 
for it 

(eda) ij := e ikl d k a{. (94) 

It is then not hard to show that 

{ed) 2 = -A. (95) 
It is also not hard to see that e<5 is self-adjoint with respect to the scalar product 

(x,y) = J ,f'. r.r'iji j (96) 

on the space of symmetric tracefree transverse tensors x lJ ,y l:l . Then, using the self-adjointness and 
(|95p we can write the Hamiltonian as 

H = ^7T 2 -m(eda + ma), (97) 
where we omitted the indices for brevity. 

4.6 Evolution equation 

Let us introduce two first order differential operators that are going to play an important role below. 
We define 

D := -idt + ed + m, D :=id t + ed + m, (98) 

where D is clearly the adjoint of D with respect to scalar product that also involves the time inte- 
gration. We note that Da is essentially the projected quantity a 2 PT l l > lu D^ai, with the gauge-fixed 
spatial connection and its conjugated momentum satisfying the Gauss equation. 
The Hamiltonian (|97|) then results in the following Hamilton equations 

- ivr = Da, Dtt = 0, (99) 

which immediately give 

= DDa = d 2 a - Aa + 2iHeda - 2H 2 a (100) 

as the evolution equation. Because of the term with ed that has a factor of i in front, this equation 
is complex. It becomes a non-trivial problem to choose a reality condition that is compatible with 
the evolution. Indeed, the naive reality condition that a %3 is real is not consistent with the evolution, 
because if one starts with a real a 13 , the evolution will generate an imaginary part. Thus, a more 
sophisticated strategy for dealing with this problem is needed. 

4.7 Second-order formulation 



Let us rewrite the original action (|79p as a functional on the space of symmetric tracefree transverse 
tensors a lJ . This can also be obtained by integrating out the momentum variable. Using the operators 
(|98p the corresponding second-order action can be written very compactly as 

S( 2 ) = -^JdtJ d 3 x{Da) 2 , (101) 

with (jlOOp following immediately as the corresponding Euler-Lagrange equation. 
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5 Reality conditions 



Our treatment of the connection field reality conditions in this section is new. This analysis constitutes 
one of the most important new results of this paper. 

5.1 Evolution equation as an eigenfunction equation 

For our later purposes, it is very convenient to write the evolution equation (jlOOp in a slightly different 
form. Thus, we use the fact that 

{D,D} = 2n 2 , (102) 
which easily follows from %' = H 2 , and write the evolution equation as an eigenfunction equation 



Ea = a, where E = -^DD. (103) 



This is the form that is going to be most useful below. 
5.2 An important identity 

We now prove an identity that lies at the root of the reality condition that is going to be imposed. 
First, we note that 

s w - -k D '- (104> 

where D* = idt + ed — YH is the operator complex conjugate to D. The above identity allows us to 
pull out a factor of 1/21-L 2 from the derivative operator D, and the expense of introducing a complex 
conjugate of D. 

We now consider the square of the evolution equation operator E: 



E2 = h Db ie Db - (105) 



We use (|104|) to convert D into D* and then use the fact that D and D* commute {D,D*} = 0. 
We then use the complex conjugate of the identity (|104p . Overall, we get the following sequence of 
transformations 

£ 2 = W D 2^°™ " " 2& DD "^ D " D ~ RR '- < 106 > 

where we have introduced 

1 

m 



R--——DD*. (107) 



Note that R is a dimensionless operator, since H carries the dimension of mass. The identity (|106p in 
particular implies that E 2 is a real operator, which is not at all obvious because E is not real. 



5.3 The reality condition 

In the case of the Dirac equation viewed as a reality condition for the spinors satisfying the Klein- 
Gordon equation, the Dirac equation appears as a "square root" of the Klein-Gordon. In our case we 
expect a second-order in derivatives reality condition, as follows from our general discussion in the 
Introduction. Thus, if it is to appear as a square root, it must be a square root of some fourth-order 
differential equation. 
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Now, as our relation (|106[) demonstrtes, in spite of the fact that the evolution equation (|103p is 
complex, we see that its square E 2 a = a, which is clearly implied by (|103p . is a real equation. This 
fourth order equation is not so interesting in itself, but introduces a new second-order differential 
operator R, such that E 2 = RR*. In other words, R is a "square root" of the real equation operator 
E 2 , similar to the Dirac operator being a square root of the Klein-Gordon one. It is then clear that if 
we define 

K = Roj, (108) 

which should be compared with f|23|) in the Introduction, then the reality condition 

Ka = a (109) 

is compatible with the evolution equation Ea = a. Indeed, the compatibility is just a rephrasal of the 
statement that on solutions of (|103p the 1Z anti-linear operator becomes an involution: 

U 2 = RR* = E 2 = Id, (110) 

where the last equation holds on the space of solutions Ea = a. Thus, 1Z is a real structure on the 
space of solutions, and the condition (|109p is a possible reality condition that can be imposed. Below 
we shall see that this is the physically correct condition, in particular by working out a relation to the 
metric description. The essence of (|109p will then be just a statement that the metric is real. 

It is worth emphasizing that all of the above happens in exact analogy with the case of Dirac 
equation, except that now the relevant "Dirac" operator is second order, and appears as a square 
root of the fourth-order operator obtained by squaring the evolution operator. This squaring of the 
evolution equation procedure is absent in the fermionic case, where the condition that the square of 
the 1Z operation is an identity is identical to the evolution equation. In our case this is not possible 
because the involution condition is necessarily fourth-order, and so it must be related to the evolution 
operator in a more non-trivial way (jllOp . 



5.4 Metric 

We can now rephrase the condition (|109p as a statement that a certain quantity is real. Indeed, we 
introduce 

h = -^Da, (111) 
V2M 

where the prefactor is introduced for convenience and also in order to give h the same mass dimension 
as a. Below we will show that h can be viewed as just a possible new configuration variable on the 
phase space of the theory, with the Hamiltonian form action principle in terms of this variable taking 
an explicitly real form (|124|) . 

The evolution equation in its form (|103p can now be rephrased by saying that it gives the inverse 
relation 

a= =- Dh. (112) 

Taking now the hermitian (complex) conjugate of the quantity h in (jllip . requiring it to be real 

/i f = h, (113) 

and then substituting h = Da/y2M into (|112p we get precisely the reality condition (|109p . Thus, 
the essence of the condition (|109p imposed on the space of solutions Ea = a of our theory is indeed 
in the statement that the quantity (jllip is real. We note that this interpretation of the reality 
condition in terms of some quantity being real is not present in the case of the Dirac equation. Such 
an interpretation became possible because our reality condition is second order in derivatives, unlike 
the first order Dirac equation (=reality condition). 
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5.5 Evolution equation for the metric 

As the last result of this section, let us use the identities derived above to obtain an evolution equation 
for the variable h. It is not hard to see that this equation is 

-^D*Dh = h. (114) 



2% 

Indeed, using (|104p we can rewrite this as 

D ^2 Dh = h or D^DDa = Da, (115) 

where to obtain the last equation we have used the relation (jllip . The equation obtained is just the 
evolution equation Ea = a with the operator D applied to it. Thus, (j!14|) clearly follows from (|100|) , 
It is also worth noting that it is a real equation, as is appropriate for a quantity that can consistently 
be assumed to be real. 

6 Canonical transformation to the metric variables 

The purpose of this section it to explicitly carry out the field redefinition (jllip and see that it can get 
completed (once the momentum variable is considered) into a canonical transformation on the phase 
space of the theory. The content of this section is new. 

6.1 Canonical transformation - momentum shift 

It is very convenient to eliminate the na cross-term in (|97p by shifting the momentum. Thus, we 
define 

7f = 7T - i(e0 + m)a. (116) 

Because of the last, time dependent (via T~L) term the transformation of the symplectic form gives rise 
to a contribution to the Hamiltonian. In other words, modulo surface terms we get 

V 2 

nd t = Trd t a + -^-a 2 , (117) 

where we have used %' = T~L 2 . We now drop the tilde from the momentum variable, and write the 
reduced action in the Hamiltonian form as 

£( 2 ) = J dt J d 3 x {ird t a - H) , (118) 

with the Hamiltonian given by 

H=^iT 2 + ^(eda + ma) 2 -^-a 2 . (119) 
The convenience of the new momentum variable lies in the fact that 

d t a = it. (120) 
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6.2 Canonical transformation to h variables 



From the previous section we know that we should be able to describe the dynamics in terms of the 
variable 

h = -^(m + (ed + m)a), (121) 

and that this variable can consistently be assumed to be real. The canonically conjugate momentum 
p to h is of course only defined modulo o-dependent shifts. However, if we insist that there is no ph 
terms in the resulting Hamiltonian, then the momentum variable can be determined to be given by 

We emphasize that this is a linear canonical transformation on the phase space of the theory. 
6.3 Metric Hamiltonian 

There are many contributions from the symplectic irdta term to the Hamiltonian in terms of h,p 
variables. After a rather tedious computation one finds that the action can be written as 



where 



J dt J d 3 x (pd t h - H) , (123) 



B-^ + @£=?*M>e. (124) 

As a check, we note that this Hamiltonian goes into that for a massless field in the limit M — > 0. 
Indeed, using the explicit expression (|127p for T~L one sees that T-L/M — > 1 when M — > 0. This shows 
that the above Hamiltonian has the correct Minkowski limit. As for the de Sitter Hamiltonian, the 
above is the standard Hamiltonian for the de Sitter space spin 2 part of the metric perturbation hav 
rescaled by the conformal factor c(t). 



6.4 Second-order formulation 

It is also instructive to write the above action in the second-order form, by integrating p out. We get 
= —M 2 f dt f d 3 x ^ (D*D - 2U 2 ) h = -M 2 f dt f d 3 x (^±^(Dh) 2 - h 2 ^j , (125) 

where we have integrated by parts in the (dth) 2 term to get the first expression for the action, which 
is explicitly real, and have used (|104p to get the second, more symmetric expression. The first version 
of the action clearly leads to (|114p as the corresponding Euler-Lagrange equation. 

It is worth emphasizing that the connection formalism linearized action (jlOip is actually simpler 
than the same action (j!25|) in the metric description. Here we are comparing only the completely 
symmetry reduced actions, but the same holds true also about the full linearized Lagrangians in 
the two formulations. The graviton gauge-theoretic Lagrangian (|79p is much simpler than its metric 
variant. And, although we do not discuss it in any length in this paper, the connection Lagrangian 
(|79p (in its Euclidean signature version where all fields are real) is actually a non-negative function in 
the space of fields, which is not the case for the Euclidean signature metric Lagrangian because of the 
conformal mode. We will give a more detailed comparison of the off-shell Lagrangians in the second 
paper of the series, when we work out the propagator. 
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7 Canonical quantization and the mode decomposition 



We now perform all the usual steps for the canonical quantization of the theory (jlOip . with the reality 
condition (|109p . Our main aim is to obtain a mode decomposition with correctly normalized creation 
and annihilation operators. The content of this section is new. 



7.1 Choice of the time coordinate 

We first explicitly solve the evolution equation (jlOUp for the connection, so that the linearly inde- 
pendent solutions later become the modes of the field. For this, let us first introduce a convenient 
parameterization of the c(t) and T~L functions. We choose 

c(t) = - — (126) 
w 1 - Mt ' 

so that c(0) = 1, i.e. we have chosen to origin of the time coordinate in such a way that t = 
corresponds to the conformal factor of unity. With this parameterization we get 

M 

H = -i TT- (127) 

1 - Mt K ' 

7.2 Spatial Fourier transform 

We now perform the spatial Fourier transform, and choose convenient polarization tensors. Thus, 
consider a mode of the form a % le . The transverse condition <9ja Jjf on the connection implies that the 
corresponding mode aV is orthogonal to k % . For this reason, it is very convenient to define 

z l {k) := /\k\, (128) 

i.e. a unit vector in the direction of the spatial momentum. We then define two (complex) vectors 
m l (k) , m' (k) that are both orthogonal to z l and whose only non-zero scalar product is m l rhi = 1. 
They satisfy 

ie l i k Zjmk = m^, ie i ^ k Zjfh] c = —rhi, ie^^rrijfhk = z%. (129) 

Here we have omitted the momentum dependence of these vectors for brevity, but it should all the 
time be kept in mind that they are k dependent. Thus, when we replace k — > —k the vectors m l ,m l 
get interchanged: 

m\-k) = m*(Jb), m*(-fc) = m\k). (130) 
It is very important to keep these transformations in mind for the manipulations that follow. 



7.3 Polarization tensors 

The fact that a* J is symmetric tracefree transverse implies that every mode e lkx comes in just two 
polarizations. For the corresponding polarization tensors it is convenient to choose m l {k)m :, {k) and 
in 1 (k)m J (k) . We shall refer to the mm mode as the negative helicity particle, while the mm mode will 
be referred to as the positive one. We will explain a reason for this choice below. 

Let us now consider the action of the operator ed on the two polarizations. We have 

(ed^m^^e^ = oj k m i m j a^e ikS , (ed)fh i m j ale lkx = -w^mW ' a\e lki ', (131) 
where we have introduced 

oj k := \k\. (132) 
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In other words, the two modes we have introduced are the eigenvectors of the operator ed with 
eigenvalues ±w k respectively. Our choice of the name for the mm mode as negative may seem unnatural 
at the moment (since it is the positive sign eigenvalue of ed). However, it becomes more natural if one 
computes the corresponding Weyl curvatures for the two modes. One finds that the negative mode 
has zero self-dual Weyl curvature, and is thus a purely anti-self-dual object. This is why it makes 
sense to refer to it as the negative helicity mode. 



7.4 Linearly independent solutions 

We now write the evolution equation (jlOOp as an equation for the time evolution of the Fourier 
coefficients. We get, for each of the modes 

d 2 a~ + (u 2 k + 2iHoj k - 2U 2 )a- = 0, d 2 t a\ + {u 2 k - 2iHu k - 2U 2 )a\ = 0. (133) 

Note that the positive helicity equation is just the complex conjugate of the negative helicity one. 

Each of the above equations is a second order ODE, and thus has a positive and negative frequency 
solutions. It is not at all hard to obtain then explicitly, and they read 

Ue-^\ a, ~ ( 1 - — - ^) , (134) 



It is interesting to note that one of the modes in each case is given by a rather simple expression, 
with the time-dependence of the amplitude being just that of T~L. The other mode in each case is 
more involved. For the negative mode it is the positive frequency solution that is simple, while for the 
positive mode the positive frequency solution is involved. This is a manifestation of a general pattern 
in our formalism, in that the negative helicity mode will always be much easier to deal with then the 
positive helicity one. 

Another point worth emphasizing is that one of the two linearly independent solutions of the 
connection evolution equation is actually simpler than the modes in the metric description, see (|142p 
below. This gives yet another illustration of the general statement that we would like to promote - 
the connection description is in many aspects simpler than the metric one. 



7.5 Action of the D operator on the modes 

It is useful to compute the action of the basic operator D on the modes (|134j) . We will need this when 
we impose the reality condition (|109p . which can be written as a = (l/2H 2 )D(Da)l We have 

Dm i m j 'He- iWkt+i%S = 2w fc m i m^e- i ^* +ife ( 1 + — V (135) 

V u k / 

DfhW^e-^"^ (l + — - ^] = -m l mi—e~^ t+i ^ + 

ri \ Uk 2u(J LO k V u kJ 

DmW-e 5 "**-^ ( 1 - — - ^] = mW— e^**-** ( 1 - — V 
rl V 2u(J uj k \ u) k J 

DTrfmPHJ"**-** 3 = -2w fe m i m%e i ^*- ife ( 1 - — ) . 
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Now, to impose the reality condition, we take the complex conjugates of the right-hand-sides, and 
then apply the operator D to them. We get 



2u k Drh i m>H^ Uht - i ^ (l-—\= (2w fc ) 2 mW'"He ilJfc *- ife ( 1 - — - ^] , (136) 
\ uJkJ \ u k 2uf,J 



k- 

14 - ( if/ \ 

■DmW-e iutl - iK 1 - — = mW-^-e 3 "**- 1 * 2 , 
\ UJkJ lo% 



Dm i m'-e- iut ' +iK 1 + — = mW^-e - ^*^ 1 * 2 , 



-2w fc D7fi i m^e- iaJfct+ife f 1 + — ^ = (2w fc ) 2 m i 7fi^e- i ^ t+ife ( 1 + — - . 
7.6 The mode expansion 

Using the above results, we can now write down a mode expansion satisfying the reality condition 
(HMD. We get 

a «(t )£? ) = /" f * Lya-^^W + tf^fa-lt^e"*^ f 1 - — - ^] (137) 
K ' ; 7 (2vr)32a; fe L fc J%j h % \ uj k 2u?J K ' 

-rh l miai^^e-^ t+ ^ (l + — - ^] - m'm? (a+V ■_^_ ( Mt-i&?' 



k H \ "k 2a;2y """" ^ 

Here all the vectors m l ,m l are fc-dependent, but this dependence is suppressed in order to have a 
compact expression. We could have chosen to put a plus sign in front of the positive helicity modes, 
but below we shall see that the above choice leads to a more symmetric expression for the metric mode 
expansion. 

Note that the reality condition makes it unnatural to put factors of M in front of the modes. 
Thus, as it stands, the expression (|137[) does not have the Minkowski limit M — > 0, because some 
terms go to zero in this limit, and some other terms blow up. This is one difference with e.g. the 
Major ana fermion, which has a very similar type of the mode expansion. However, in that case there 
is a massless m — > limit in which half of the modes are set to zero, but the other half survives and 
gives the mode expansion of the Weyl fermion. In our case the connection (|137|) does not admit the 
M -)• limit. 

We also note that in (|137p only the relative coefficient between the a, a' terms in each helicity 
sector is fixed by the reality condition, so we could have multiplied each sector by an arbitrary constant 
factor. By doing this we could obtain an expression that survives in the M — ¥ limit. However, we are 
now going to show that the mode decomposition (|137|) is written in terms of canonically normalized 
operators. We do this by computing the commutators as implied by the canonical Poisson brackets 
between the connection and its conjugate momentum. 

7.7 Commutators 

We start with the relation that the equal time connection and its conjugate momentum should satisfy: 

[aij(t,x),d t a k i{t,y)] = i5 3 (x - y)P ijk i- (138) 
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For the conjugate momentum we have 

d 3 p 



dt(iij(t,y) 



-lU) n 



(2vr) 3 2w p 

—fh t (p)fh :, (p)(a 



V2w p V Up J 



■ ' H 



V 

P iuj p t-ipy I 1 _ J±_ , 



jV^W,, i,.,_f_i^s? T~L Z iH 



UJ r , 



(139) 



Substituting this into (|138p . and using the fact that under k — > —k the vectors m ! ,ra' get interchanged, 
as well as the fact that for any k 



Pijkl = mimjrrikmi + mimjrn k m u 



we get 



[at,(at) ] \ = {^?^k5 i {k-p), 



(140) 



(141) 



which are the canonical commutational relations for the creation-annihilation operators in field theory. 
This gives one confirmation of the correct normalization used in (|137|) . Another confirmation comes 
by computing the metric, and then the associated Hamiltonian. 



7.8 Metric 



Let us now use (|137p to obtain the mode decomposition for the metric (jllip . The action of the 
operator D on all the modes has already been computed in (|135|) . We get 



h ij (t,x) 



H 



M J (2vr) 3 2o;fc 



rfik r - / YN 

{rrjm j a7 + m l m j a+)e~ iuJkt+ikx 1 + — 



(142) 



+(m l m J (a k y + m l m? (a^y) 



i m Hn + ^ t \ Ju k t-lkx 



1 



m 



This expression has an obvious (correct) Minkowski limit M — > 0. It is also explicitly hermitian. It 
is in order to obtain the above symmetric expression that we chose to introduce the minus signs in 
front of the positive helicity modes in (|137p . To compute the Hamiltonian in terms of the modes, let 
us also give an expression for the momentum p = (M 2 /T-L 2 )dth. We get 



U J (2vr) 3 2w fe 



''" k (-iw fc ) Um'mPal + fh i fh^at)e- i ^ t+ii£ ( 1 + — - ^ 

2YH 2U 2 



(143) 



-{m l m 3 {a k y + m l m 3 {a^ 



i rr,0( n + \t\J^kt-ikx 



The Hamiltonian (|124p then reads: 



H = - 

2 



(144) 



The Hamiltonian is explicitly time dependent, as is appropriate for particles in time-dependent de 
Sitter Universe where the energy is not conserved. We note that it has the correct Minkowski limit 
M -> 0. 
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8 Discrete symmetries 



In this section we obtain the action of the discrete C, P, T symmetries on the connection field, and 
on the creation-annihilation operators. 



8.1 Charge conjugation 

Our fields are "real", in the sense that we do not have independent operators in front of the positive 
and negative frequency modes. The metric is explicitly real. Thus, the charge conjugation acts trivially 
- all operators go into themselves. 



8.2 Parity 

We could obtain the action of parity from the mode expansion for the metric, which is standard. We 
could also just directly define the action on the operators. Indeed, parity changes the sign of the 
spatial momentum, and interchanges the two helicities: 

Pt a ±p = a T fc . ( 145 ) 

In view of f)142|) this is equivalent to 

P^h ij (t,x)P = h ij (t,-x). (146) 

It is much more interesting to obtain the parity action on the connection field. Using (|145p and the 
mode decomposition (|137f) we get 

P^a ij (t,x)P = -{a ij (t,-x))K (147) 

The minus sign in this formula can be interpreted as being related to the fact that we are dealing with 
the spatial connection, which changes sign under parity. But most importantly, we see that parity 
is related to the hermitian conjugation of the connection field operator. This is reminiscent of what 
happens in the case of fermions, where the parity at the level of 2-component spinors is also related 
to the hermitian conjugation of the spinor fields. 



8.3 Time reversal 

Time-dependent physics in de Sitter space is not time reversal invariant. However, it can be made to 
be such by simultaneously reversing the sign of the time coordinate and the sign of the parameter M. 
This sends one from one patch of de Sitter space (covered by the flat slicing) to another patch where 
the time flows in the opposite direction. Hence, it must be a symmetry of the theory. The action of 
the time reversal, which is an anti-linear operator, can then be obtained by requiring 

T^h ij (t,x)T = h ij (-t,x) . (148) 

This gives, at the level of the operators 

Tta±T = a± fc . (149) 

While parity flips the sign of the spatial momentum while leaving the particle's spin unchanged, which 
results in flipping of the helicity, time reversal flips both the momentum and the spin, which does not 
change helicity. At the level of the connection we get 

T^a ij (t,x)T = a ij (-t,x) . (150) 

M-»~Af 
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8.4 CPT 



We now combine all of the above transformation rules into the action of the CPT transformation. We 
see that, modulo an overall minus sign, this action is that of the spacetime inversion (t, x) —> —(t,x), 
as well as the hermitian conjugation of the field. This is of course standard in field theory. Note, 
however, that in our case the hermitian conjugation comes not from the charge conjugation, in spite 
of the fact that the field is complex. Rather, it is a part of the parity transformation. But the end 
result is the same: CPT is hermitian conjugation together with the spacetime inversion. This is 
the CPT theorem for our theories - a hermitian Lagrangian will be CPT invariant. At the same 
time, hermiticity of the Lagrangian is important for unitarity of the theory. While we have seen this 
hermiticity at the linearized level (e.g. by going to the metric description), the question whether there 
exists an appropriate real structure on the space of solutions of the full theory that allows a real 
section to be taken is open. 

9 Discussion 

Let us recap the main points of our construction. We have studied diffeomorphism-invariant gauge 
theories of the type ([26]) with the gauge group SL(2,C), with the aim of describing the linearized 
theory around a background connection that corresponds to the de Sitter space. We have seen that all 
theories of this type coincide at the linearized level, and describe massless spin 2 particles. We have 
also seen that the arising connection evolution equation is in general complex, with the imaginary part 
appearing with a factor of the Hubble parameter % in front. Thus, in a time-dependent background 
such as the one given by the de Sitter space, the connection cannot be taken to be real. We also gave 
general arguments to the same effect based on the fact that the (linearized) connection realizes an 
intrinsically complex spinor representation S+ ® S— of the Lorentz group. At the same time, we have 
seen that a real structure exists on the space of solutions, and that this can be used to select a real 
section in the phase space, on which one obtains a theory with a hermitian Hamiltonian. All this was 
shown to be quite analogous to the treatment of fermions in which they are described as complex fields 
satisfying the Klein-Gordon equation, with an additional first order in derivatives reality condition 
(Dirac equation) imposed. The main difference with the case of fermions was that in our case the 
reality condition was necessarily of the second order in derivatives. We have also seen that this second 
order nature of the reality conditions is what guarantees that a real (metric) description exists. 

We have avoided discussing the above statements in the spacetime form, staying all the time at 
the level of the phase space formulation. On one hand this makes things more clear. On the other 
hand, for path integral computations it is necessary to develop the spacetime version of the mode 
decomposition. This will be accomplished in the second paper of the series, where this formalism is 
used to compute the graviton scattering amplitudes. One of the reasons why this was not treated 
already in the present paper is that it requires a much more detailed introduction into the spinor 
techniques (e.g. spinor helicity), and this would take us too far from the present goal of expanding 
the connection into the canonically normalized creation-annihilation operators. 

Let us finish with a very brief list of the open problems of this approach. The one that is most 
directly related to the topics covered in this paper is that of unitarity. Thus, it is not clear if there 
exists a satisfactory way to select a real section of the non-linear dynamics described by a general 
theory from the class (|26p . However, the fact that this is possible in the linearized theory around such 
a time-dependent background as de Sitter, and the fact that at least for one of the theories from this 
class, namely GR, this is possible also at the full non-linear level, allows for optimism. 

The other major open problem of this approach is coupling to matter. Many types of bosonic 
matter can be coupled just by enlarging the gauge group, i.e. considering still theories of the same 
general class (|26|) . but with a larger G D SL(2,C). In particular, Yang-Mills fields, as well as e.g. 
a massive scalar field can be coupled this way naturally. A very interesting symmetry breaking 



30 



mechanism selecting what should be called the gravitational SL(2, C) then becomes available, see [9] 
for more details. However, the arising matter /gravity dynamics should be studied in more details, in 
particular with the reality conditions issues in mind. An open question is that of coupling of fermions. 
This seems difficult in the usual first-order in derivatives formalism, but it should also be kept in mind 
that the fermions can also be described via a second-order in derivatives action, with a first-order 
reality condition imposed, as described in more details in the Introduction. This brings fermions 
much close to what seems to be at work in the class of theories considered here, and raises hopes that 
they can be coupled satisfactorily. 

The third major open problem of this approach is renormalizability. It has been conjectured in 
that the class ([26]) with G = SL(2, C) is closed under renormalization. Work is in progress on testing 
this conjecture at one loop. Even if this turns out not to be the case for G = SL(2,C), it will still 
be possible that only for some specific choices of G the class of theories (|26p becomes renormalization 
closed. For example, this may be the case when G is an appropriate graded Lie group (i.e. a Lie 
supergroup). Such more general choices of G may in any case be necessary to describe fermionic 
particles with their anti-commuting Grassmann- valued fields. These various version of the conjecture 
[TTj should be tested, and the formalism developed here for G = SL(2, C) is a necessary prerequisite 
for computations of this type. 
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A Appendix: Self-dual two-forms 

For any self-dual two-form we have: 

\^v pa U pa = iU^, (151) 

and for anti-self-dual form we have an extra minus on the right-hand-side. The space of self-dual 
two- forms being 3-dimensional, we can introduce a basis in it. A choice of such basic self-dual two- 
forms can be rather arbitrary as long as they span the required subspace. However, there is always a 
canonical (modulo certain gauge rotations, see below) choice of the basis. Let us denote such canonical 
basis self-dual two-forms by E* i = 1,2,3. Note that we have denoted the index enumerating the 
two- forms by the same letter as was used to refer to the spatial index in the Hamiltonian analysis. 
This is not an oversight; the two indices can be naturally identified, see below. The canonical basic 
self-dual two-forms are defined to satisfy 

e^S^S^ = 8i^, (152) 

where the numerical coefficient on the right is convention-dependent, and 5 lJ is the Kronecker-delta. 
It can be shown that the self-dual two-forms satisfying (|152p are defined uniquely modulo SO(3) 
rotations preserving 5 lJ . We can now give an explicit form of the basic self-dual two- forms in the case 
of the Minkowski spacetime metric. Using the two-form notation we have: 

E* = idt A dx l + ^e ijk dx j A dx k . (153) 

it is not hard to check the EL, are self-dual (with the conventions that e 0123 = +1), and that (j!52|) 
holds. Let us also note what becomes of the components of the basis self-dual two-forms EL, under 
the space+time split. We have: 

K, i^- Ej fc = eV (154) 
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Thus, we see that the objects indeed provide a natural identification of the basis index i with the 
spatial index. Let us also note an important identity satisfied by our self-dual two- forms. We have 

Ej^Ejp = S^rj/ + e ijk Z k /. (155) 

Thus, the basic self-dual two-forms satisfy an algebra similar to that of Pauli matrices. This identity 
can be checked by direct verification, using the explicit expression (|153p . 
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